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Therapeutic Area 
#1 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Map 
Location 



x> 
r-i 

W 
IN 


Allele 
Freq. 
(pred.) 

good 7 of 
20 

good 4 of 
8 

9 of 30 

44 of 1050 

p value 


s 

© 
PI 

1 30E-O6 

1 70E-1 ! 

Name of protein identified following a 
BLASTX analysis of the CuraGen 
sequence 

Human Gene SWISSPROT-ID PI 7858 6- 
PHOSPHOFRUCTOKINASE, LIVER 
TYPE (EC 2 7 1 11) 
(PHOSPHOFRUCTOKINASE 1) 
(PHOSPHOHEXOKINASE) 
(PHOSPHOFRUCTO- 1 -KINASE 
ISOZYME B) - HOMO SAPIENS 
(HUMAN), 780 aa 

Human Gene SWISSPROT-ID P17858 6- 
PHOSPHOFRUCTOKINASE, LIVER 
TYPE (EC 2 7 1 11) 
(PHOSPHOFRUCTOKINASE 1) 
(PHOSPHOHEXOKINASE) 
(PHOSPHOFRUCTO- 1 -KINASE 
ISOZYME B) - HOMO SAPIENS 
(HUMAN), 780 aa 

Human Gene Similar to SWISSPROT- 
ACC P07148 FATTY ACID-BINDING 
PROTEIN, LIVER (L-FABP) - Homo 
sapiens (Human), 127 aa 

Calpactm 

O J) 

.2 cj} 

.5 I J 
s - s « 
2 S a 
&, ~ U 

kinase 

kinase 

UNCLASSIFIE 
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Type of 
Change 

& 
H 
D 

CONSERV 
ATIVE 

CONSERV 
ATIVE 

CONSERV 
ATIVE 

Amino 
Acid after 


Ser 

Ala 

Val 

Am ino 

Acid 

before 


Cys 


Ala 

Base 
after 

00 

GC (10) 

T(12) 

T(14) 

Base 
before 

G(7) 

CG(9) 

C(H) ! 

C(13) 

Polymorphic 
Sequence 

CT-A- 

TCCCTGGCCACC 

TG-CCAGGCCTCC 

CTC[G/C]GGCTGG 

TGTCTT-GAGA- 

CCA-GCCTG- 

CCAGGCCC 

ATCATCCAGCTG 

GGCGGCACTATC 

ATTGGCAGCGCT 

CGCT[CG/GC]AAG 

GCCTTTACCACC 

AGGGAGGGGCG 

CCGGGCAGCGG 

TTGAGTTCGGTC 

ACAGACTTGATG 1 

TTTTT-GA- 

AAG[C/T]TGTCAC 

CAGTTTATTGTCA 

CCTTCCAACTGA 

ACCACTGTCTTG 
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Is 111 
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ca ti o o o so 

c3 O O « O0 CD 

Base 
pos. of 
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3171 
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CI 
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Calling 
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cg449 12878 
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IS 


33 


Therapeutic Area 
#2 


Immunology 

Immunology 


Therapeutic Area 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Map 
Location 




IN 

"5. 

Allele 
Freq. 
(pred.) 

8 of 900 

3 of 260 

6 of 200 

6 of 100 

p value 


2 20E-11 

3 OOE-10 

8 50E-10 

Name of protein identified following a 
BLASTX analysis of the CuraGen 
sequence 

Calpactin 

CD151 

CD151 

Clustenn/ApoJ 

Protein 

classiflcation of 
CuraGen gene 





Type of 
Change 

NON- 

CONSERV 

ATIVE 

NON- 

CONSERV 

ATIVE 

NON- 

CONSERV 

ATIVE 

Silent- 

Codmg 

Amino 
Acid after 

Asp 

Tyr 

Pro 


Amino 

Acid 

before 

Gly 

Cys 

His 


Base 
after 

A (32) 

A (34) 

<n 
U 

T(38) 

Base 
before 

G(31) 

G(33) 

A (35) 

C(37) 

Polymorphic 
Sequence 

Atgccatctcaaatggaac 
acgccatggaaaccatgat 
gtttacatttcacaaattcgct 
ggggataaag[g/a]ctactt 
aacaaaggaggacctgaga 
gtactcatggaaaaggag 

so « £ ai « 
5P 3) ob tS g 

111 f| 

Ml O « O O £P 

tgtgaccagcgctgtggacc 
agctgcagcaggagttcc[a 
/c] ctgctgtggcagcaaca 
actcacaggactggcgaga 
cagtgagtggatccg 

TAC-G-AGAAGG- 
CGAC-G-ATG-A- 
CCGG- 

ACTGTGTGCCGG 

G-AG-A-T- 

CCG[C/T]CACAA- 

CT-CCACGGG- 

CTGCC-T-GCGGA- 

T-G-AA-GG-ACCA- 

GTG-T-G-AC 

J s £ 

so &w 

r-t 

o 
o 
o 

% 

1446 

Sequence 
Calling 
Asseni bly 

95124747 

91234048 

91234048 

94218949 

SEQ ID 

31,32 

33, 34 

35, 36 

37, 38 


Therapeutic Area 
#2 




Therapeutic Area 

Metabolic/endocn 
ne/cardiovascul ar 

Metabolic/endocn 
ne/cardiovascul ar 

Metabolic/endocn 
ne/cardiovascular 

Map 
Location 

12q24 1 


o 

(N 

If? 
< £ a 

3 of 130 

5 of 600 

24 of 120 

p value 

■* 

i 

U3 

2 10E-14 

Name of protein identified following a 
BLASTX analysis of the CuraGen 
sequence 

SercAl 

Valosin-containmg protein 

Glycogen Phospliorylase Muscle 

Protein 

classification of 
CuraGen gene 




Change 

NON- 

CONSERV 

ATIVE 

NON- 

CONSERV 

ATIVE 

N ON- 
CON SERV 
ATIVE 

Amino 
Acid after 

Val 

Ser 

Ser 

Amino 

Acid 

before 

Met 

Ala 

Ala 

Base 
after 

a 
O 

T(42) 

T(44) 

Base 
before 

A (39) 

G(41) 

g(43) ; 

Polymorphic 
Sequence 

GTCAA-TGTCGG- 

TTTAC-TG- 

TACACCA-AA- 

TAAA-CCA- 

AGCAGG-AC- 

ATCAA-TGAGC- 

AA — G[A/G]T- 

GTTTGTGAA-GGG 

TG-CTCC-TG-AA- 

GG-TG-TCA- 

TTGACA-GG-T- 

GCACCC-A-CA- 

TTCGA-G-TT 

gatgacattggtggctgcag 
gaagcagcta[g/t]ctcag 
ataaaggagatggtggaact 
gcccctgagacatcctgccc 
tcttaaggcaattgg 

AA-TG-ATAAC-TT- 

C-TTTG--A-GGGG- 

AA-GG-A-GCTG-C- 

GG-C-TG-AAGC-A- 

GG-AGT-ACTT-C- 

G-T-GGTGG- 

CC[G/T]CCA- 

CGCT-C-C-AGGAC 

AT-C-AT-CCGC-C- 

G-C--TTCAAGT-C- 

G-T-CCAAG- 

TTCGG-CTG- 

CCGGGA-CCC-TG 

Base 
pos. of 
SNP 

1876 

1132 

1232 

Sequence 

Calling 

Assembly 

95351416 

97873686 

95292679 

SEO ID 

39, 40 

41,42 

43,44 


Therapeutic Area 



Immunology 


Therapeutic Area 

Hi 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Map 
Location 


£ 


n 

1 i 1 

6 of 10 

7 of 45 

91 of 250 

5 of 50 

p value 

6 50E-17 




Name of protein identified following a 
BLASTX analysis or the CuraGen 
sequence 

Pyruvate dehydrogenase kmase-like 
protein 

Galactostdase sialotransferase 

CD151 

Rab5 -interacting protein 

Protein 

classification of 
CuraGen gene 





Type of 
Change 

Silent- 
Codmg 

SILENT 

NONCODI 

NG 

Silent- 
Coding 

SILENT 

NONCODI 

NG 

Amino 
Acid after 





Amino 

Acid 

before 





Base 
after 

C(46) 

C(48) 

A (50) 

G(52) 

Base 
before 

A (45) 

T(47) 

T(49) 

m 
CJ 

Polymorphic 
Sequence 

TGATGGATAATT 

CCCGGAATGCTC 

CTTTGGCTGGTTT 

TGGTTACGGCTT 

GCCAATTTCTCGT 

CTGTATGC[A/C]A 

AGTAC-TTTCAA- 

GGAGATCT-GAAT 

CTC-TACTC- 

TTTAT-C-AGGA- 

TATGG-AACAGA- 

TGCTAT 

CT- 

GGACCTGATTT- 
CC- 

TGACCACAGGC- 

TCTTGAAG[T/C]- 

CCCCATGGT- 

CTTGCTGAC-AGA 

GG- 

CCCCTAGAGTAA 
AAGGAGC 

ggggagcttctgtccacctg 
tcctgcagaggagtcgtttc 
cagcccggc[t/a]gcccca 
ggatgggtgagttcaacga 
gaagaagacaacatgtggc 
accgttt 

ATTAAAGATTTG 

ATTTATTCAAGTA 

TGTGAAAACATT 

CTACAATGGAAA 

CT[C/G]TTATTAG 

ATGCTGCATGTA 

CTGTGCTATGGA- 

CCAC-GCACAT- 

ACAGCC- 

ATGCTGTTTC- 

AGAAGAC 

Base 
pos, of 
SNP 

1353 

2804 

r- 
<m 

CM 

00 

Sequence 

Calling 

Assembly 

91231553 

95108682 

91234048 

94131544 

SEQ ID 

"* 

47, 48 

49, 50 

51,52 


Therapeutic Area 


Renal Diseae 



Therapeutic Area 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardio vascular 

Metabolic/endocn 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Map 
Location 

a. 

m 
a. 

Ip31 1-31 3 

| 

III 

3 of 12 

9 of 30 

4 of 1 1 

25 of 90 

p value 





Name of protein identified following a 
BLASTX analysis of the CuraGen 
sequence 

Adipocyte-specific protein 

Valosin-contaming protein 

Medium Chain Acyl Coa Dehydrogenase 

Creatine Kinase Muscle 

Protein 

classification of 
CuraGen gene 





Type of 
Change 

SILENT 

NONCODI 

NG 

SILENT 

NONCODI 

NG 

SILENT 

NONCODI 

NG 

SILENT 

NONCODI 

NG 

Amino 
Acid after 





Amino 

Acid 

before 





Base 
after 

C(54) 

T(56) 

C(58) | 

gap (60) 

Base 
before 

G(53) 

O 

T(57) 

C(59) 

Polymorphic 
Sequence 

CACCTC-CCT- 
CACCACACAGGA 
CCCTGAGT- 
GAGGA- 

GGAGGGGCTGGA 

AACCTGGG[G/C]T 

GGGTTGGCCAAA 

GGAGAACCTCAG 

GCTCCTGGCCTG 

GCCCAGCTCCTT 

CCTGCCCAAGGT 

AGCTTAGCCCAT 

CC 

CACAGCCTGCTC 
CAriCICCAG- 
TCTGAACAGTTC 
AGCTA- 

CAGTCTGACTCT 

GGACA- 

GGG[G/T]GTTT- 

CTGTTGC- 

AAAAATACAAAA 

CAAAAGCGATAA 

AATAAAAG- 

CGATTTTCATTT 

AGCTTGCCTTAA 

ATTATTTTTATAT 

GACTGTTGGTCT 

CTAGGTAGCC- 

TTTGGTCTATTGT 

ACACAA[T/C]CTC 

ATTTCATATGTTT 

GCATTTTGGCAA 

AGAACTTAATAA 

AATTGTTCAGTG 

aaggccaccatgcttttattta 
tcgctttg[c/gap]tggaga 
caaagcacaagctccgagt 
gtgctgggagctctccatta 
actagag 

M CM 

On 
O 

o\ 

3429 

2183 

3121 

Sequence 

Calling 

Assembly 

95343665 

97873686 

97978029 

95289295 

SEQ ID 

53, 54 
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Renal Dis* 




CNS Dis< 

Therapeutic Area 
#1 

Metabolic/endocri 
ne/cardiovascular 

Metabolic/endocn 
ne/cardiovascular 

Metabohc/endocn 
ne/cardiovascular 

Metabohc/endocn 
ne/cardiovascular 

Map 
Location 



CM 

"a- 

•■a 

Allele 
Freq. 
(pred.) 

46 of 200 

o 
m 

O 
(N 

4bp 

insertion 
polymorph 
ism 3 of 
10 

2 of 130 

alue 


1 60E-07 

10E-254 


»■ 
a. 


■+ 


led following a 
e CuraGcn 




'ROT-ID Q07869 
LIFERATOR 
'TOR ALPHA 
)MO SAPIENS 
Is SPTREMBL- 
OME 

HWATED 
-HOMO 

, 468 aa (fragment) 

£ 

Name of protein identH 
BLASTX analysis ofth 
sequence 

"1 

hpocortin 1 


Human Gene SWISSI 
PEROXISOME PRO! 
ACTIVATED RECEI 
(PPAR-ALPHA) - HC 
(HUMAN), 468 aa |pc 
ID Q 16241 PEROXIS 
PROLIFERATOR AC 
RECEPTOR ALPHA 
SAPIENS (HUMAN) 

Acyl Co A Binding Pn 

O 

J & 
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■ill 

I R 3 
O- ~ U 

Kinase 



~a 
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Peptid 
hormo 

Change 

SILNET 
NON- 
CODING 

Termination 

Frameshift 

NON- 

CONSERV 

ATIVE 

lino 
id after 


fOP 



1 

< < 


£ 




Amino 

Acid 

before 


Gin 


Lys 


i(78) 

o 1 

OO 


O 

< ^ 

00 

« a 

DO « 


H 


o ss 

< 

Base 
before 

A (77) 

C(79) 

gap (81) 

T(83) 

Polymorphic 
Sequence 

TTTATACAATAC 

AT-ACAATTA- 

TCA-GG-AATG-C- 

AAAAAAAAAA[A/ 

gap]CATAAATAAT 

GCCCATTTT-A- 

CA-GG-TG-A-C-A- 

TTTTAAA-C-AA- 

TG-AAAAA-C- 

ACCAACGG 

AAAGGTGTGGAT 

G-AAGCAACCAT- 

CATTGACA- 

TTCTA-ACTA-AG- 

CGAAACAATGCA 

[C/T]AGCG- 

TCAACA- 

GATCAAAGCAGC 

ATAT-C-T-CC 

AGAAACAA- 

ATGCCAG-TATTG- 

TC-GATTT- 

CACAAGTGCCTT 

TCTGTCGGGATG 

TCACACAACG[ga 

p/GTAG]CGATTCG 

TTTTGGACGAA- 

TGCCAAGATCTG 

AGAAAGCAAAAC 

TGAAAGCAGAAA 

CGAT-GG-CT-T- 

GG-TCTT-A- 

AGGTGCCT-A-A- 

CCTCCTCT- 

GC AGC [T/A]TT- 

CTCAAAC-T-CAG- 

CCTGAGA- 

CATCCT-GG-C-C- 

GACTT- 

GCAAGAACT- 

CCA 

o 

I *S 

CO a. <J} 

00 
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t» 

v> 

r- 
t-> 
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\o 
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11057 

|f! 

07234 

>3086< 
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^1067 
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Therapeutic Area 
HI 

Oncology 

Oncology 

I 

< 

V 

•G 
9 

4> 

a. 

1 

H * 

Metabol ic/endocn 
ne/cardiovascular 

Metabolic/endocri 
ne/cardiovascular 

Map 
Location 

6p24 1-25 3 

i 

Allele 
Freq. 
[prcd.) 

3 of 200 

8 of 50 

p value 



Name of protein identified following a 
BLASTX analysis of the CuraGen 
sequence 

DBI-retated Protein 

DBI-related Protein 

Protein 

classification of 
CuraGen gene 

Peptide 
hormone 

Peptide 
hormone 

Type of 
Change 

NON- 

CONSERV 

ATIVE 

Conservativ 
e 

Amino 
Acid after 

Ser 

Ala 

Amino 

Acid 

before 

Asn 

Val 

Base 
after 

G(92) 

T(94) 

Base 
before 

(16)V 

C(93) 

Polymorphic 
Seouencc 

ATA-TT-CC-CC-C- 
T-G-GT-GG-AG-T- 
AG-AG-GAG- 
AAAG-CTA- 
AAAATA[A/G] TG- 
CC-GT-TTTA-C- 
TG AGGG A A-T-TT- 
G~T- 

GGGCTGTTTTAT 
AGATTTT 

C-AC-TTTT-C-AG- 

AAAGAAG- 

TCTGGA-- 

CCAGGC-T-GAA- 

GGCA-TTTGC- 

AAAGCTT— 

CCCCC-AAAT- 

G[C/T] CTTG-AG- 

AATTT-C-AAAAG- 

AGG-TAAT-CA- 

GG-AAAAG- 

AGAGAG-A--G- 

AAAAACTACAC- 

GCT-GTT-AATG-C- 

TGA-AGAATG- 

CAAT-G-T-CC- 

TTCAG 

Base 
pos. of 
SNP 

oo 

1309 

Sequence 
Calling 

\o 

VO 

oc 

I 
8* 

cgl 0888 1866 

£ 

C 
&. 

GT 

91,92 

93,94 
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